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Abstract —Although Software-Defined Networking (SDN) en¬ 
ables flexible network resource allocations for traffic engineering, 
current literature mostly focuses on unicast communications. 
Compared to traffic engineering for multiple unicast flows, 
multicast traffic engineering for multiple trees is very challenging 
not only because minimizing the bandwidth consumption of a 
single multicast tree by solving the Steiner tree problem is already 
NP-Hard, but the Steiner tree problem does not consider the 
link capacity constraint for multicast flows and node capacity 
constraint to store the forwarding entries in Group Table of 
OpenFlow. In this paper, therefore, we first study the hardness 
results of scalable multicast traffic engineering in SDN. We prove 
that scalable multicast traffic engineering with only the node 
capacity constraint is NP-Hard and not approximable within 
6, which is the number of destinations in the largest multicast 
group. We then prove that scalable multicast traffic engineering 
with both the node and link capacity constraints is NP-Hard and 
not approximable within any ratio. To solve the problem, we 
design a 5-approximation algorithm, named Multi-Tree Routing 
and State Assignment Algorithm (MTRSA), for the first case and 
extend it to the general multicast traffic engineering problem. 
The simulation and implementation results demonstrate that the 
solutions obtained by the proposed algorithm outperform the 
shortest-path trees and Steiner trees. Most importantly, MTRSA 
is computation-efficient and can be deployed in SDN since it can 
generate the solution with numerous trees in a short time. 
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I. Introduction 

Software-defined networking (SDN) provides a new cen¬ 
tralized architecture with flexible network resource manage¬ 
ment to support a huge amount of data transmission ID- Dif¬ 
ferent from legacy networks, SDN separates the control plane 
from switches and allows the control plane to be programmable 
to efficiently optimize the network resources. OpenFlow to 
in SDN includes two major components: controllers (SDN- 
Cs) and forwarding elements (SDN-FEs). Controllers are in 
charge of handling the control plane and install forwarding 
rules based on different policies, while forwarding elements 
in switches deliver packets according to the rules specified by 
the controllers. Compared with the current Internet, routing 
paths no longer need to be the shortest ones, and the paths 
can be distributed more flexibly inside the network. It has 
been demonstrated that SDN provides a better overview of 
network topologies and enables centralized computation for 
traffic engineering for multiple unicast flows m, 0, El- 
However, multicast traffic engineering for multiple multicast 
trees in SDN has attracted much less attention in previous 
studies. 

Compared to unicast, multicast has been shown in empiri¬ 
cal studies to be able to effectively reduce overall bandwidth 
consumption in backbone networks by around 50% 0 - It 
employs a multicast tree, instead of disjoint unicast paths, from 


the source to all destinations of a multicast group, in order 
to avoid unnecessary traffic duplication. The current Internet 
multicast standard, i.e., PIM-SM 0, employs a shortest- 
path tree to connect the source and destinations, and traffic 
engineering is difficult for PIM-SM since the path from the 
source to each destination is the shortest one. A shortest-path 
tree tends to lose many good opportunities to reduce the band¬ 
width consumption by sharing more common edges among 
the paths to different destinations. In contrast, to minimize 
the bandwidth consumption, a Steiner tree (ST) J7] in Graph 
Theory minimizes the number of edges in a multicast tree. 
Nevertheless, ST only focuses on the routing of a multicast 
tree, instead of jointly optimizing the resource allocations of 
all trees. Therefore, when the network is heavily loaded, a link 
will not be able to support a large number of STs that choose 
the link. Most importantly. Group Table of an SDN-FE will 
be insufficient to store the forwarding entries of the STs due 
to the small TCAM size J8) . 

Compared to the shortest-path routing in unicast, unicast 
traffic engineering in SDN is more difficult to aggregate 
multiple flows in Flow Table of an SDN-FE, and the scalability 
has been regarded as a serious issue in the deployment of 
SDN for a large network 0, HI- The scalability problem 
for multicast communications is even more serious since 
the number of possible multicast groups is 0(2"), where n 
is the number of nodes in a network, and the number of 
possible unicast connections is 0(n 1 2 ). To remedy this issue, a 
promising way is to exploit the branch forwarding technique 
11101 . Ill II . |fl2| . which stores the multicast forwarding entries 
in only the branch nodes, instead of every node, of a multicast 
tree, where a branch node in a tree is the node with at least 
three incident edges. The branch forwarding technique can 
remedy the multicast scalability problem since packets are 
forwarded in a unicast tunnel from the logic port of a branch 
node in SDN-FE ID to another branch node. In other words, 
all nodes in the path exploit unicast forwarding in the tunnel 
and are no longer necessary to maintain a forwarding entry 
for the multicast group. Furthermore, when a branch node is 
not multicast capable for a tree (ex. Group Table is full in 
this paper), local unicast tunneling from a ne arby multicast 
capable node has been proposed in MBONE (T3]| and PIM- 
SNfl to allow multiple unicast tunnels to pass through the 
branch node to other nodes in the tree (an example will be 
presented later in this section). Nevertheless, compared to 
multicast, it is envisaged that local unicast tunneling will incur 
more bandwidth consumption since duplicated packets will be 
delivered in a link. Therefore, there is a trade-off between the 
link capacity and node capacity, because each branch node 


1 http://www.cisco.com/c/eii/us/td/docs/ios-xml/ios/ipmulti _pim/configuration/xe- 

3s/imc-pim-xe-3s-book/imc_tunnel.html 



can act as either a branch state node with the corresponding 
multicast forwarding entry stored in Group Table or a branch 
stateless node that exploits the unicast tunneling strategy. 

In comparison with the ST problem, scalable multicast traf¬ 
fic engineering, which jointly allocates the network resources 
for multiple trees, is much more challenging because both the 
link capacity and node capacity constraints are involved in 
the problem. The link capacity constraint states that the total 
rate of all multicast trees on each link should not exceed the 
corresponding link capacity, while the node capacity constraint 
ensures that Group Table of each node is sufficiently large to 
support the multicast trees with the node as a branch state node. 
Moreover, scalable multicast traffic engineering with branching 
forwarding and unicast tunneling techniques is able to allocate 
the network resources more flexibly. When Group Table of a 
node is full, unicast tunneling moves the resource requirement 
from the node to its incident links, whereas the rerouting of 
the tree is also promising by exploiting the resources of the 
nearby nodes and links. Therefore, it is necessary for scalable 
multicast traffic engineering to carefully examine both the 
routing and the allocation of the branch state nodes of all 
multicast trees. In this paper, we explore the Scalable Multicast 
Traffic Engineering (SMTE) problem for SDNs. Given the 
data rate requirement of each multicast tree, SMTE aims to 
minimize the total bandwidth cost of all trees, by finding a 
tree connecting the source and destinations of each group and 
assigning the branch state nodes for each tree, such that both 
the link capacity and node capacity constraints can be ensured. 


Fig. H] presents an illustrative example. Fig. 1 1 (a)| is the orig¬ 
inal network with the unit bandwidth cost specified beside each 
link. The bandwidth cost of each link is the total bandwidth 
consumption of the link multiplied by the unit bandwidth cost. 
The node capacity of each node is 1. The link capacity of edge 
e SjQ is 1, and the link capacities of the other edges are oo in 
this example. There are two multicast trees with both flow rates 
as 1. The source of the first tree is si = s, and its destination 
set is D\ = {di, d 2 ,..., dr}. The source of the second tree is 
s 2 = s, and its destination set is £>2 = {d[, d 2 , ■ ■ ■, d’ 7 }. Fig. 
|l(b)| shows the first shortest-path tree (blue) and the second 
shortest-path tree (red). The branch nodes and branch state 
nodes of the first tree are {c, it, v } and {it}, respectively. The 
branch nodes and branch state nodes of the second tree are 
{c, i>} and {c, u}, respectively. Note that v is not assigned as a 
branch state node of the first tree, and it thus needs to exploit 
unicast tunneling to d fj and d 7 directly. Therefore, traffic of 
the first tree are duplicated in edge e Uj „. On the other hand, 
if v was assigned as a branch state node for the first tree, 
traffic duplication in e,,,, would be more serious for the second 
tree since v has three downstream nodes d' 5 , d' 6 , d' 7 . T he tot al 
bandwidth cost of the two shortest-path trees in Fig. |l(b)| is 
99. 


Afterward, Fig. |l(c)| shows the first Steiner tree (blue) and 
the second Steiner tree (red), and the branch state nodes of 
the two trees are also {it} and {c, v} , resp ectively. The total 
bandwidth cost of the trees in Fig. |l(c)| is 103. Note that 
the total bandwidth cost in Steiner trees is higher since the 
assignment of bra nch state nodes are not carefully examined. 
Finally, Fig. 1(d) presents the first tree (blue) and the second 
tree (red) in SMTE with the same branch state nodes s pecifi ed 
above. The total bandwidth cost of the trees in Fig. |l(c)| is 
79, and here u is directed connected to d\, d 2 , anti 1/3 to 
avoid unicast tunneling, even though the edge cost is higher 
(i.e., 2 ) compared to the cost (i.e., 1 ) of the edge from v in 
the first Steiner tree. Therefore, this example manifests that it 
is necessary to consider the tree routing and the assignment 
of branch state nodes of multiple trees jointly for scalable 




Fig. 1. Comparison of different strategies for multicast traffic engineering 


multicast traffic engineering. 

SMTE is very challenging. The ST problem is NP-Hard 
but can be approximated within the ratio 1.55 m and is thus 
in APX of Complexity Theory. In other words, there exists 
an approximation algorithm for ST that can find a tree with 
the total cost at most 1.55 times of the optimal solution. In 
contrast, we first prove that SMTE-N (i.e., SMTE with only 
the node capacity constraint, while the link capacity constraint 
is relaxed) is NP-Hard but cannot be approximated within 6, 
which denotes the number of destinations of the largest mul¬ 
ticast group. Afterward, we prove the SMTE (i.e., with both 
the link and node capacity) cannot be approximated within 
any ratio. To solve SMTE-N, we propose a ^-approximation 
algorithm, named Multi-Tree Routing and State Assignment Al¬ 
gorithm (MTRSA), which can be deployed in SDN-C. MTRSA 
includes two phases: Multi-Tree Routing Phase and State-Node 
Assignment Phase, to effectively minimize the total bandwidth 
cost of all trees according to the node capacity constraint. 
We first focus on the node capacity (i.e., SMTE-N), instead 
of the link capacity, because the scalability in Group Table 
is unique and crucial for SDN and has not been explored in 
previous studies of multicast tree routing for other networks. 
Since no (A 1 “^-approximation algorithm exists in SMTE- 
N for arbitrarily small e > 0, MTRSA achieves the best 
approximation ratio. Afterward, we extend MTRSA to support 
SMTE with the link capacity constraint. 

The rest of this paper is organized as follows. Section 2 
introduces the related work. Section 3 and 4 formulate SMTE 
with Integer Programming and describe the hardness results. 
We present the algorithm design of MTRSA in Section 5, and 
Section 6 shows the simulation and implementation results on 
real topologies. Finally, Section 7 concludes this paper. 

II. Related Work 

The issues of traffic engineering for unicast traffic in SDN 
have attracted a wide spectrum of attention in the literature. 
Sushant et al. ed developed private WAN of Google Inc. 
with the SDN architecture. Qazi et al. ESI designed a new 
system in SDN to control the middleboxes, and Mckeown et 
al. E] studied the performance of OpenFlow in heterogeneous 
SDN switches. Agarwal et al. El presented unicast traffic 
engineering in an SDN network with only a few SDN-FEs, 






















while the other routers in the network followed a standard 
routing protocol, such as OSPF. However, the above studies 
focused on only unicast traffic engineering, and multicast 
traffic engineering for multiple multicast trees in SDN has 
attracted much less attention. 

To support the multicast communications, the current mul¬ 
ticast routing standard PIM-SM (6) relies on unicast routing 
protocols to discover the shortest paths from the source to the 
destinations for building a shortest-path tree (SPT). However, 
SPT is not designed to support traffic engineering. Although 
the Steiner tree (ST) (3 minimizes the tree cost and the volume 
of traffic in a network, ST is computationally intensive and 
is not adopted in the current Internet standard. Overlay ST 
HU, G3D, on the other hand, presents an alternative way 
to construct a bandwidth-efficient multicast tree in the P2P 
environment. However, the path between any two P2P clients 
is still a shortest path in Internet, and it is, therefore, difficult 
to optimize the routing of the P2P tree. Most importantly, both 
SPT and ST are designed to find the routing of a tree, instead 
of jointly optimizing the resource allocation of multiple trees. 

Flow table scalability is crucial to support large-scale SDN 
networks due to the limited TCAM size. Kanizo et al. a, who 
showed that the major bottleneck in SDN is the restricted table 
sizes, proposed a framework called Patette to decompose a 
large SDN table and distribute its entries across a network. 
Leng et al. GO proposed a flow table reduction scheme 
(FTRS) to reduce flow table usage with omnipotent controller 
functions. DIFANE f8] distributed the flow entries to multiple 
SDN switches. Zhang at al. da built a multicast topology 
(single backbone tree) for NFV, while Craig et al. adjusted 
the link weights for shortest-path trees in SDN OTI . Huang 
et al. also tried to optimize the routing of single multicast 
tree in SDN (22), [.23j. Nevertheless, the above studies were 
not designed for minimizing the total resource consumption 
in multicast traffic engineering with multiple trees subject to 
both the node and link capacity constraints in SDN. 

III. Problem Formulation 

In this paper, we explore the Scalable Multicast Traffic 
Engineering (SMTE) problem for SDN. Given the data rate 
requirement of each multicast group, SMTE aims to minimize 
the total bandwidth consumption of all multicast groups in the 
network, by finding a tree connecting the source and destina¬ 
tions of each group, and assigning the branch state nodes for 
each tree, such that the number of multicast forwarding states 
will not exceed the size of Group Table in each node, and the 
total multicast flows on each edge will not exceed the link 
capacity. Note that a branch node can only facilitate unicast 
tunneling for a multicast group if it is not assigned as a branch 
state node in the corresponding multicast tree. 

More specifically, given a network G(V,E), where V and 
E denote the set of nodes and directed edges, respectively, let 
b v denote the maximal number of branch state nodes that can 
be maintained by node (Q. Let Nf (N~) denote the set of 
out-neighbor (in-neighbor) nodes of v in G. Node u is in Nf 
(N~) if e„ jJX ( e u>v ) is a directed edge from v to u (from u to 
v) in E, and c u , v is the capacity of e u , v , while k uv is the unit 
bandwidth cost of e U) „. Let T = ... ,T t ) denote the 

set of multicast trees, while Sj acts as the root of tree 7’, £ T, 
i.e., the source with data rate and the destination set IX, 
contains the set of destinations in 7) £ T. In the following, 
we first formally define SMTE, while the derivation of the 


-In the following, we first assume that the memory size allocated in Group 
Table to maintain the branch state node of each multicast tree is the same, 
and later we extend it to the general scenario that supports different memory 
sizes for different multic ast trees according to the degrees of the node in the 
trees [l) in Section |MCl 


bandwidth consumption will be explained later in this section 
in the proposed Integer Programming formulation. Dynamic 
group members hip w ith user join and leave will be discussed 
later in Section IV-CI 

Definition 1. For network G(V, E) and multicast groups T, 
SMTE is to find the routing of each tree 7j in T spanning Si 
and Di and assign the branch state nodes in T t to minimize 
the total bandwidth cost, such that each node u acts as the 
branch state nodes of at most b u trees, and total multicast 
bandwidth consumption in each edge e u , v is at most c Ui „. 

In the following, we present the Integer Programming (IP) 
formulation for SMTE. SMTE includes the following binary 
decision variables to find the routing of each multicast tree 
and the assignment of branch state nodes. Let binary variable 
Tti,d,u,v denote if edge e u<v is in the path from Si to a 
destination node d in Di in 7). Let integer variable £i, uv 
denote the number of times that each packet of 7) is sent in 
edge e u<v via multicast (once) or unicast tunneling (multiple 
times according to the number of tunnels). Let binary variable 
0 itV . denote if v is a branch state node in 7). Intuitively, when 
we are able to find the path from s, to each destination node d 
of Ti with 7 Ti y d, u ,v = 1 on every edge e u , v in the path, together 
with the set of state branch nodes ff :V , the routing of the tree 
(the set of edges e Uj „ with £i, u ,v > 1) can be constructed 
according to the paths from Si to all destination nodes in I),. 

The objective function of the IP formulation for SMTE is 
as follows. 

min E E /*x^x £w) . 

l<i<t e U)V £E 

The objective function minimizes the total bandwidth cost of 
all multicast trees. For each tree Ti, the following constraints 
first describe the routing assignment (i.e., TTi,d,uv ) f° r the 
path connecting the source Si and each destination in I) r . 
Afterwards, we assign the branch nodes (i.e., in different 
nodes and then derive the bandwidth consumption (i.e., £i, u ,v) 
of via multicast and unicast tunneling. 


y. 7T i,d,Si,v y 71 i,d,v,Si — 1, VI A i < t, d £ Di, 

v&Ns I 

7 ^i,d,u,d 7 Ti d,d,u — L VI A i fC t, d £ Di, 

uGN~ u£N+ 

yi 7 ^i,d,v,u 7T i^d,u,,v > 

v£N£ vGNf 

VI < i < t,d £ Di,u £ V,u f d,u f Si, 

7^i,d,u,v 0 £i^u,V) < 1 < t,d £ Di,Ve UtV £ E, 

|Llj| X 0i,u T £ i,u,v G y 

VI < i < t,u £ V,u f Si, 
y 0i,u < bu, Vit e V, 

1 <i<t 

yi fi X £i,u,v Xk Cu,vi ^£u,v ^ 77. 
l<i<t 


( 1 ) 

( 2 ) 

(3) 

(4) 

(5) 

(6) 
(7) 


The first three constraints, i.e., (1), (2), and (3), are the 
flow-continuity constraints for each tree Ti to find the path 
from Si to every destination node d in Di. More specifically. 
Si is the source node, and constraint (1) states that the net 
outgoing flow from Si is one, implying that at least one 
edge eifrom s, to any neighbor node v needs to be 
selected with 7 Ti,d,si,v = 1- Note that here decision variables 
7 Ti t d, Si ,v and 7 Ti,d, v ,si are t wo different variables because the 
flow is directed. On the other hand, every destination node d 
is the flow destination, and constraint (2) ensures that the net 
incoming flow to d is one, implying that at least one edge 





ei tUt d from any neighbor node u to d must be selected with 
7r i,d,u,d = 1- For every other node u, constraint (3) guarantees 
that u is either located in the path or not. If u is located in the 
path, both the incoming flow and outgoing flow for u are at 
least one, indicating that at least one binary variable TTi t d, v ,u 
is 1 for the incoming flow, and at least one binary variable 
ni,d,u,v is 1 for the outgoing flow. Otherwise, both Tr^d,v,u 
and TTi t d,u,v are 0. Note that the objective function will ensure 
that ni t d,v,u = 1 for at most one neighbor node v to achieve 
the minimum bandwidth consumption. In other words, both 
the incoming flow and outgoing flow among u and v cannot 
exceed 1. 

Constraints (4) and (5) are formulated to find the routing 
of the tree and its corresponding branch state nodes, i.e., £i u ,v 
and Constraint (4) states that £i tU , v is at least 1 if edge 
e u>v is included in the path from to at least one d, i.e., 
TTi t d, u ,v = 1- The tree 7) is the union of the paths from s t to 
all destination nodes in Di. Constraint (5) is the most crucial 
one. For each node u in Tj, if it is not a branch state node, 
i.e., /3i >u = 0, u does not maintain a forwarding entry of I) 
in Group Table and thereby facilitates unicast tunneling. In 
this case, constraint (5) and the objective function guarantee 
that the number of packets received from an incoming link 
e v _ u must be the summation of the number of packets sent 
to every outgoing link e UtV . By contrast, when 3 lAl = 1, 
constraint (5) becomes redundant because the Left-Hand-Side 
(LHS) is smaller than 0 and thereby imposes no restrict on the 
Right-Hand-Side (RHS). In this case, constraint (4) ensures 
that £i tV>u = 1 for every incident edge e VtU with TTi t d, v ,u as 1- 
Therefore, u is multicast capable for Tj, and each packet is 
delivered once in every incident link. 

The last two constraints are capacity constraints. Constraint 
(6) states that each node u can act as a branch state node of at 
most b u trees in T, while constraint (7) describes that the total 
multicast bandwidth consumption of in each directed edge e v _ v 
cannot exceed c UA! . 

IV. Hardness Results 

In the following, we first show that SMTE-N is very 
challenging in Complexity Theory by proving that it is NP- 
Hard and not able to be approximated within 6 C for every 
c < 1, where 6 = maxi<j< t \Di\. Afterward, we prove that 
SMTE cannot be approximated within any ratio. 

The Steiner tree problem is a special case of SMTE- 
N. However, SMTE-N is much more challenging than the 
Steiner tree problem because the Steiner tree problem can be 
approximated within ratio 1.55 and is thus in APX in Com¬ 
plexity Theory. In contrast, we find out that SMTE is much 
more difficult to be approximated. The following theorem first 
proves that SMTE-N cannot be approximated within 5 C for 
every c < 1, where 5 = maxi<j< t \Di\, by a gap-introducing 
reduction from the 3SAT problem. 

Theorem 1. For any e > 0, there exists no 

{5 l ~ e )-approximation algorithm for SMTE-N, where 
<5 = maxi<j< t \Di\, assuming P NP. 

Proof: We prove the theorem with the gap-introducing 
reduction from the 3SAT problem. 

The 3SAT problem is a simplification of the regular SAT 
problem. An instance of 3SAT is a conjunctive normal form 
(CNF) in which each clause contains exactly three variables. 
The 3SAT problem is to decide, given a Boolean expression cf> 
in CNF such that each clause contains exactly three variables, 
whether <j> is satisfiable. 

For any instance <j> of the 3SAT problem, we build an 
instance G( V. E) of SMTE-N with two multicast trees, where 


the destination sets are D i and D 2 . Let OPT(G) denote 
the optimal solution of G for SMTE-N. The goals of the 
reduction are two-fold. 1) If <j> is satisfiable then OPT(G) < 
Ap q+l . 2) If <f> is not satisfiable then OPT(G) > (4 p q+1 ) x 
(max{|C-[ |, |Z? 2 1}) 1—e - In the above two goals, n is the number 
of Boolean variables in <j>, m is the number of clauses in cf>, 
p = maxjm, n} and q is a large number (derived later). 

To achieve the above goals, we build the instance of SMTE- 
N from each instance of the 3SAT problem as follows. Given 
an instance <j> of 3SAT with n Boolean variables x±,... ,x n 
and m clauses Gi,...,G m , we construct a directed graph 
G(V, E) in the following way. 1) The node set V is partitioned 
into four node sets {s}, U, D\, and D 2 . 2) U includes 2 n 
nodes ui,ui,u 2 ,U 2 ,... ,u ni uH (nodes Ui and ui correspond 
to the Boolean variable xf), and for each i with 1 < i < n, 
there are directed edges ( s,Ui ) and (s,ui). 3) D\ has mp q 
nodes d- ’, where 1 < j < to and 1 < k < p q (nodes d^ , 
1 <k< p q , corresponding to p q copies of the clause Cf), and 
there exists a directed edge ( Ui,d (( ui,d ^)) if and only 
if the variable Xi ($7, resp.) appears in G,. 4) D 2 contains 
(k) 

np q nodes iv) , where 1 < i < n and 1 < k < p q , and 

there are directed edges ('Ui, u } j k ' 1 ) and ( ui,w for each i,k 
with 1 < i < n and 1 < k < p q . Note that G only has the 
directed edges described above, p = maxjm, n}, and q is the 
smallest integer such that q > (3 + log p 4)/e. Fig. [2] presents 
an illustrative example of an SMTE-N instance. 

The cost of each edge from s to U is set as p q , and the cost 
of the other edges are set to be 1. The capacity of each node 
is set as 1. Let s and D\ be the source node and destination 
set of Ti respectively, and let s and D 2 be the source node 
and destination set of T 2 respectively. 

If <f> is satisfiable, there is a truth assignment to x t such 
that (j> is true. Let A = {ui : Xi is assigned to be true} U {ui : 
Xi is assigned to be false}. Consider the tree T\ rooted at s 
that includes 1) the edges between s and A, and 2) the edges 
between d!'^' 1 and one of its neighbor in A (the existence of 
its neighbor in A comes from that <f> is satisfiable). Consider 
the tree T 2 that includes 1) the edges between s and U\A, 2) 
the edges between U\A and D 2 . Then (7), If) is a feasible 
solution of SMTE-N, and it can act as an upper bound of 
SMTE-N in G. The total edge cost of 7\ is np q + mp q , and 
the total edge cost of T 2 is np q + np q . Since the node capacity 
is sufficient, the total bandwidth cost of this feasible solution 
is 3 np q + mp q < 4 p q+1 . Hence, we have OPT(G) < 4 p q+1 . 

On the other hand, if <fi is not satisfiable, let (7j, r J' 2 ) be any 
feasible solution. For 1 < k < p q , let A be the set consisting 
of every i with 1 < i < n, such that u,; and uf are adjacent 
to some nodes in {d^ :> : 1 < j < to} along the edges in 
7). Since & is not satisfiable, /;. is not empty for each k with 
1 < k < p q . By pigeonhole principle J24), there exists at least 
one i* with 1 < i* < n such that i* is in at least 2- sets of 

— — p 

{/i, I 2 ,..., I p i}. In Ti, therefore, m* has at least — > P q 
downstream destination nodes, and uff has at least >P q 1 
downstream destination nodes. On the other hand, in T 2 , 
and need to dominate p q downstream destination nodes of 
T 2 . If m* or ui* is not a branch state node in T\, then the 
total cost is at least p 2?_1 . On the other hand, if Ui* and ui* 
both are branch state nodes in Ti, since the capacity of node 
Ui* and ui~ are 1, neither u,* nor ui* are branch state nodes 
in T 2 . The total cost is at least p 2q . 

Therefore, the total cost of the optimal solution is 
larger than p 2<?_1 , and we have OPT(G) > p 2q ~ l = 

q — 2 —log„ 4 

(Ap q+1 )(p q ~ 2 ~ los r 4 ) = (4 p q+1 )(p q+1 ) 5+i - 




Fig. 2. An illustration of instance building from 3SAT to SMTE-N 


3+logp 4 

(4 p q+l )(p q+1 ) 1 5+T - > (4 p q+1 ){p q+1 ) 1 ~ t > 

(4p 9+1 )(max{|Di|, |£> 2 |}) 1_e - Since e can be arbitrarily 
small, for any e > 0, there is no (max{|Z?i|, |f? 2 |}) 1_e 
approximation algorithm for SMTE-N, assuming P f NP. 
The theorem follows. ■ 

In the following, we prove that SMTE cannot be approxi¬ 
mated within any ratio. 

Theorem 2. For any polynomial time computable function 
f, SMTE cannot be approximated within a factor of /(|H|), 
unless P = NP. In other words, for arbitrary positive integer 
k, SMTE cannot be approximated within \V\ k . 


Let s and D i be the source node and destination set of T \, 
respectively Let s and Z? 2 be the source node and destination 
set of T 2 respectively. 

If <f> is satisfiable, there is a truth assignment to Xi such that 
< f> is true, let W = {ui : Xi is assigned to be true} U {ui : Xi is 
assigned to be false}. Consider the tree T\ rooted at s including 

1) the edges between s and W and 2) the edge between each 
dj and one of its neighbor in W (the existence of its neighbor 
in W comes from that <b is satisfiable). Consider the tree T 2 
which includes 1) the edges between s and U\W and 2) the 
edges between U \ W and Z) 2 . Then (Ti,T 2 ) is a feasible 
solution of SMTE and it can act as an upper bound of SMTE 
in G. The total edge cost of T\ is to + n and the total edge 
cost of 7) is 2 n. Since the node capacity is sufficient, the 
total bandwidth cost of this feasible solution is to + 3 n. Hence 
OPT(G) < to + 3n. 

On the other hand, if 0 is not satisfiable, let (Tj , If) be 
any feasible solution. Since 0 is not satisfiable, there is an i 
such that both edges ( s,Ui ) and (s,ui) appear in T\ for any 
feasible solution of SMTE, in order to span all destinations in 
D i. Therefore, the edges (s, uf) and ( s,ui ) cannot be included 
in T 2 due to the link capacity constraint, and T 2 thereby needs 
to choose the directed edge (s, d'f). The total edge cost of 
T 2 is at least (to + 3n) x /(|F|) in this case, and the total 
bandwidth cost of the optimal solution in SMTE is greater than 
(TO + 3n) x f(\V\). Therefore SMTE cannot be approximated 
within a factor of f(\V\), unless P = NP. ■ 


Proof: Assume, for a contradiction, that there is a polyno¬ 
mial time approximation algorithm A with the approximation 
raio f(\V\) for SMTE. This proof will show that A can be 
used for deciding the 3SAT problem in polynomial time, thus 
implying P = NP. 

Specifically, given a graph G, let OPT(G) denote the 
optimal solution of G for SMTE. For any instance of the 3SAT 
problem, we build an instance G(V. E) for SMTE with two 
multicast trees with the destination sets D\ and Z) 2 . The goals 
of the reduction are two-fold: 

1) if <f> is satisfiable then OPT(G) < to + 3 n, and 

2) if (f> is not satisfiable then OPT(G) > (TO + 3n) x f(\V\), 

where n is the number of Boolean variables, to is the 
number of clauses, and / is a polynomial-time computable 
function. 

To achieve the above goals, we build the instance of SMTE 
from each instance of the 3SAT problem. Given an instance 0 
of 3SAT with n Boolean variables xi,... ,x n and to clauses 
Gi,..., C m , we construct a directed graph G( V. E ) such that 

1) the node set V is partitioned into four node sets s, U, 
D i, and Z? 2 ; 

2) U contains 2 n nodes ui,ui 1 U 2 ,U 2 , ■ ■ ■ ,u n ,uf (nodes m 
and ui are corresponding to the Boolean variable xf), and 
for each i with 1 < i < n, there are directed edges (s, uf) 
and (s,u“); 

3) D\ contains to nodes d\, , d m (node dj corresponds to 

the clause Cj), and there exists a directed edge ( Ui,dj ) 
((uf, dj)) if and only if the variable Xi (xi, resp.) appears 
in Cj- 

4) D 2 contains n nodes d[,..., d' n and for each i with 

1 < f < n, there are directed edges ( s,d [), and 

(W, <;); 

5) G only has the directed edges described above. 

The cost of each edge from s to D 2 is set as (m + 3 n) x 
/(|H|) , and the cost of every other edge is set to be 1. The 
capacity of each directed edge is set to be 1, and the data rate 
of each tree is also 1. The node capacity is set as 2. 


V. Algorithm Design 

In the following, we first propose a 5-approximation al¬ 
gorithm, named Multi-Tree Routing and State Assignment 
Algorithm (MTRSA). for SMTE-N, where S = maxi<j< t |A|. 
Note that we first focus on the node capacity, instead of the link 
capacity, because the scalability in Group Table is crucial in 
SDN and has not been explored in previous studies of multicast 
tree routing for other networks. Since Theorem |T| proves that 
there is no (^“^-approximation algorithm of SMTE-N for 
any e > 0, MTRSA achieves the best approximation ratio. 
Afterward, we extended it to support SMTE. 

A. Algorithm Description 

MTRSA includes two phases: 1) Multi-Tree Routing Phase 
and 2) State-Node Assignment Phase. Multi-Tree Routing 
Phase first constructs an initial multicast tree for each multicast 
group to minimize the total bandwidth consumption and bal¬ 
ance the distribution of branch nodes in different trees. State- 
Node Assignment Phase then finds the branch state nodes for 
each multicast tree to follow the node constraint. 

1) Multi-Tree Routing Phase: Initially, Multi-Tree Rout¬ 
ing Phase constructs a shortest-path tree with source Si and 
destination set Di for each tree Ti € T. A node u is full if 
it acts as a branch node for h u multicast trees. By contrast, 
u is overloaded if it acts as a branch node for more than 
b u trees. In this case, u needs to act as a branch stateless 
node for some of those trees and thereby will incur more 
bandwidth consumption. To address this issue, after finding the 
shortest-path trees, if there is an overloaded node, we adjust 
the local tree routing nearby the overloaded node to move the 
branch node to another node that has not been full, in order 
to balance the distribution of branch nodes among different 
multicast trees. 

More specifically, if any node u is an overloaded node and 
a branch node in any tree 7), MTRSA chooses a node v of T, 
such that: 1) v is a downstream to u in Ti, 2) v is a branch node 
or a destination node of Ti, and 3) there is no other branch 
node or destination node in the path from u to v in Ti. In 
other words, v is a nearby downstream branch node of u and 




a destination node. MTRSA reroutes the path (from u to v) 
to another path (from w to v) as follows, in order to alleviate 
the storage load in u. Let i denote the total bandwidth cost of 
the path from u to v in A). We find a new path from w to v 
such that: 1 ) the total cost of the new path is at most l, 2 ) the 
new path does not pass through any exiting node in 7',, and 3) 
this new path starts from an on-tree node w such that i) it is 
not a leaf node of A), and ii) it is not full or overloaded. We 
update tree T: by substituting the old path from u to v in 7', 
with the new path from w to v, and the overload situation in 
u can be alleviated accordingly. Afterward, we process every 
other downstream branch node v of u until u is no longer a 
branch node for Ti. The above process is repeated for every 
tree TV iteratively until u is no longer overloaded. 

Example. Consider the following example in Fig. |3(a)| 
Let G(V. E ) be the network with two multicast trees T[ ana 
T 2 with the data rate as 1. The number on each edge is the 
unit bandwidth cost of this edge, and the node capacity of 
each node is 1. The source si of the first tree Ti is s with the 
corresponding destination set D\ = {d\, cfe,..., ds}, while 
the source S 2 of T 2 is also s, but the destination set is D 2 = 
{d' 1; d' 2 , ■ ■ ., d' e }. In Multi-Tree Routing Phase, we first find 
the blue and red shortest-path trees T\ and T 2 in Fig. |3(b)| 
Afterward, we adjust the multicast trees for overloaded nodes. 
Specifically, the node capacity of a is 1, but a is a branch node 
of both 7j and T 2 . Therefore, a is an overloaded node, and 
MTRSA examines nodes di,d 2 ,v,c, which are downstream 
nodes of a in T\. MTRSA first reroutes the path {a. b. c} in 
tree T\. Since node y is overloaded, node c cannot be rerouted 
from node y. In contrast, node v is a full branch node of T\, 
and MTRSA reroutes node c from node v for 7j as shown in 
Fig. |3(c)| Note that the bandwidth cost is efficiently reduced 
since the new path from v to c is much smaller than the one 
from a to c. Therefore, Multi-Tree Routing Phase addresses 
both the node capacity and the bandwidth consumption for 
scalable multicast traffic engineering. 

2) State-Node Assignment Phase: It is worth noting when 
the network is heavily loaded, the first phase may not be 
able to ensure that every overloaded node can be successfully 
adjusted to balance the distribution of branch nodes in different 
trees, and State-Node Assignment Phase is crucial in this 
case to minimize the increment of bandwidth consumption 
due to unicast tunneling through branch stateless node. More 
specifically, State-Node Assignment Phase includes two stages: 
1) Greedy Assigning Stage, and 2) Local Search Stage. Greedy 
Assigning Stage assigns the branch state nodes by iteratively 
maximizing the redu ction of the number of branch state nodes, 
and later in Section IV-BI we prove that the number of branch 
state nodes reduced by the Greedy Assigning Stage is at least 
half of the number of branch state nodes reduced by an optimal 
strategy. Local Search Stage then improves the solution by 
further alleviating the assignment on overloaded nodes and 
rerouting the trees to further reduce the total bandwidth cost. 
We detail the two stages as follows. 

For each multicast tree Ti obtained in Multi-Tree Routing 
Phase, let Wi denote the set of branch nodes in Tj, and W = 
U i <,;</: W, . On the other hand, let A,; be the set of branch state 
nodes in T to be decided in this phase, and A, thereby is a 
subset of Wi . Let c(Tj, A,) denote the total bandwidth cost of 
Tj with the set of branch state nodes as A, . More precisely, 
c(Ti,Ai ) = Y.v&AiUDi c ( p v), where P v is the path from the 
closest upstream branch state node in A,; or the source to v, 
such that all internal nodes of P v are not in A,, and c(P v ) is 
the cost of all edges in P v . In other words, if there is no branch 
stateless node in P v , every packet is delivered only once on 
every link of P v . By contrast, if P v includes a branch stateless 
node u, each packet is sent multiple times on the links from 


the closest upstream branch state node to u, corresponding to 
the unicast tunneling case. 

An assignment A of branch state nodes can be defined as 
follows: A is a 0,1-matrix with the rows indexed by {1 ,..., t} 
and columns indexed by W, such that 1) the l’s in row i can 
only be the columns indexed in Wi, and 2) the number of l’s 
in column w £ W is no more than the node capacity b w . We 
assign a branch state node w £ W to tree Tj if and only if the 
(i, w) entry of A is 1. In other words, the first condition ensures 
that a branch state node can only be assigned to a branch 
node of Tj, while the second condition is the node capacity 
constraint. Given an assignment A of branch state nodes, let 
Aj = {u> £ W : the ii,w) entry of A is 1} denote the set of 
branch state nodes for Ti, and the total bandwidth cost for the 
set T of all multicast trees with the state-node assignment A is 
c(T, A) = Xa<i<t c (Ti, Aj). Since an assignment A of branch 
state nodes can also be regarded as a subset of N, where N = 
{1,..., t} x W, let AA be the family of subsets of N satisfying 
the above two conditions (hence AA is the family of all feasible 
assignments of branch state nodes to T), and we use c(T, 0) 
to denote the total bandwidth cost of T without assigning any 
branch state node. Now let the set function 2 : AA —> R such 
that 2 (A) represents the cost reduced by assignment A. More 
formally, 2 (A) = c(T, 0) — c(T, A) for each A £ AA. 

The above matrix representation plays a crucial role in 
Greedy Assigning Stage when we prove the quality of the 
stat e-nod e assignment based on Matroid Theory later in Sec¬ 
tion [WB] This stage starts from a branch state node assignment 
0 and cost c(T, 0), and iteratively assigns one branch state 
node for a tree in T until no more assignment can reduce 
c(T, A). More precisely, in each iteration, if the present branch 
state node assignment is A £ AA, we choose an element x in 
N — A such that: 1) A U {x} is in AA and follows the node 
capacity constraint, and 2) z(A U {a;}) = max yg (jv-A) z(A U 
{y}). In other words, the first condition guarantees that the 
new assignment is feasible, whereas the second condition 
chooses the node leading to the maximal reduction on c(T, A). 
Afterward, Local Search Stage first adjusts the assignment 
of branch state nodes for overloaded nodes iteratively. In 
each iteration, we first extract an overloaded node u and 
then compute the reduction of the bandwidth cost with a 
branch state node assigned to u for each tree T spanning u, 
assuming that the state-node assignment of other nodes are 
not changed. Afterward, this phase sorts the trees according 
to the bandwidth reduction and chooses the b u trees with the 
largest reduction, whereas the branch state nodes are assigned 
to them accordingly. This stage is repeated until all overloaded 
nodes are carefully examined. Afterward, this stage reroutes 
the paths from other branch nodes of a tree in order to find a 
smaller tree with the same assignment of branch state nodes. 
More specifically, for any branch node u in tree Tj, we choose 
nodes v and w of Tj in the same way as the Multi-Tree Routing 
Phase in order to find a new path from w to v, and w is not 
full. 

Example. In Greedy Assignment Stage of the State-Node 
Assignment Phase, w hen th ere is no branch state node, the total 
bandwidth cost in Fig. |3(c)| is c(T, 0 ) = c(Ti, 0 )+c(T 2 , 0 ) = 
142 + 92 = 234. If we assign a branch state node on v for tree 
Ti, the bandwidth cost of the path s,a,u,v can be reduced 
by (4—1) times since there are 4 downstream destination 
nodes d^,di,d^, de of v in T\. The reduced cost is the largest 
among all possible branch state node assignments. Therefore, 
MTRSA first assigns a branch state node on v for tree T\ 
with the cost reduced by (4 — 1) x ( k Sia + k a , u + k U} v) = 63. 
It then assigns a branch state node on y to T 2 with the cost 
reduced by (3 — 1) x (k S:W + k WtV ) = 30. Afterward, node a is 
assigned as a branch state node for T 2 with the cost reduced by 
















Fig. 3. An example of MTRSA 


(3 — 1) x (k s ,a) = 18, and node c is assigned as a branch state 
node for T 2 with the cost reduced by (2 — 1) x ( k a ,b+kb, c ) = 7. 
In Local Search Stage, there are three overloaded nodes a, c, 
and y. For overloaded node a, this phase moves the branch 
state node on a from T 2 to T\ without changing the branch 
state nodes of the other nodes. If we assign a branch state 
node on a to T±, it becomes possible to reduce the cost of T\ 
by (3 — 1) x ( k Sia ) = 18. In contrast, if we assign a branch 
state node on a to T 2 , we are able to reduce the cost of T 2 by 
only (2 — 1) x (k S! a) = 9. No des c and y are then processed 
similarly. Finally, in Fig. |3(d)| since node y has been a branch 
state node, node c can be re-routed to node y in T 2 , an d the 
total bandwi dth co nsumption is reduced from 107 in Fig. |3(c)| 
to 93 in Fig. |3(d)| accordingly. 

B. Approximation Ratio and Time Complexity 

In the following, we first examine the quality of assignment 
for branch state nodes in the second phase. We prove that 
(N, M) is a matroid and the set function z : M —> M is 
a nondecreasing submodular set function. Therefore, accord¬ 
ing to the Matroid Theorem for maximizing submodular set 
function [1251 , we have the following theorem. 

Theorem 3. The number of branch state nodes reduced by 
the Greedy Assignment Stage is at least one half of the branch 
state nodes reduced by the optimal assignment of branch state 
nodes. 

Proof: In the following, we first prove that Ad is a 
matroid. M is the family of subsets of N = (1,..., t} x W 
(i.e., we put the elements of N in a t x \W\ array) such that 
the elements in the i-th row are in the columns indexed by 
Wi, and the number of elements in the column indexed by w 
is at most the capacity of w. Hence, by definition, we have: 
1) 0 G M, 2) If A C B £ M, then A £ M, and 3) If 
A, B £ Ad with |A| < \B\, then there is an element b £ B 
such that A U {6} £ AT Therefore Ad is a matroid. 

Now we prove that the set function z : Ad C 2^ R is 
submodular and nondecreasing. Let A,B £ Ad with A C B 
and c £ N be the element in row i and column w such that 
A U {c}, B U {c} £ Ad, since z(A U {c}) — z(A) is the cost 
reduced by assigning a branch state node in node w to tree 7) 
with branch state node assignment A, and since z(B U {c}) — 


z(B) is the cost reduced by assigning a branch state node in 
node w to tree 7) with branch state node assignment B, we 
have z(A U {c}) — z(A) > z(B U {c}) — z(B). Hence, 2 is 
submodular. Let A,B £ Ad with A C If by definition of z, 
we have z(A) < z(B), and z thereby is nondecreasing. 

Let Zopt be ma x{z(A) : A £ Ad} and Zq be the result 
from our algorithm. By a result on maximizing submodular set 
function on matroid ll25l . we have ^ OFT ~,^ < 77 . Hence, 
Zg > \Zqpt- The theorem follows. ■ 

Then, we prove that MTRSA is a ^-approximation al¬ 
gorithm for SMTE-N, where 5 is the maximum size of the 
destination sets. Since Theorem |T| proves that there is no 
approximation algorithm with ratio S 1 ~ £ for any e > 0 , 
the following theorem shows that MTRSA achieves the best 
approximation ratio. In contrast, since SMTE cannot be ap¬ 
proximate within any ratio unless P = NP, it is impossible 
to derive an approximation ratio for any algorithm of SMTE, 
and we thereby evaluate MTRSA for SMTE in Section [VT1 

Theorem 4. MTRSA is a 5-approximation algorithm for 
SMTE-N, where 5 = maxi <j< t IAI- 

Proof: Let the set of multicast trees T* = (X*...., T *) 
with the assignment A* of branch state nodes be the optimal 
solution to SMTE-N, and W* = U \-fW* with W* as the 
set of branch nodes of T*, whereas A* = {w £ W* : 
the (i,w) entry of A* is 1} be the set of branch state nodes 
in T*. Therefore, the optimal bandwidth cost is c(T*. A*) = 
£* =1 c(T*. A*). For MTRSA, Multi-Tree Routing Phase first 

constructs the shortest-path trees TA = (T^,..., X^A), 
and the rerouting procedure of the Multi-Tree Phase outputs 
the new trees T* . MTRSA finally generates the trees 7" A 
with the assignment TP 3 -* of branch state nodes. Let A be 
an assignment such that for each non-overloaded node u 
within T (1) , MTRSA assigns a branch state node on u to 
each tree in T l 1 ■' with u as a branch node. According to the 
State-Note Assignment Phase, we have A C /A 3 L On the 
other hand, in Multi-Tree Routing Phase, MTRSA updates the 
trees only when the bandwidth cost does not increase, and 
the branch state nodes can only reduce the cost. Therefore, 
we have c(T( 3 \A( 3 )) < c(fr^\A^>) < c(T^ 2 \A). In the 
rerouting procedure of Multi-Tree Routing Phase, suppose we 
reroute T to T'. Since each rerouting step does not create 
any new overloaded node, and the new path in T' ensures 
that c(T',A) < c(T, A). Therefore, c(T^ 2 \A) < c{T^\A) 
holds by induction, and we have c{T^ 3 \ A^) < c(T* 2 \ A) < 
c(T ( - 1 \ A) < c(T ( - 1 \ 0). Since the path P^h from the source 
Si to node d in Di in tree Tj 1 ' 1 is the shortest path from Si to d 
in G, and T* has a path from ,s,; to d, we have c(P^ d ) < c(T* ) 
for every i and every d £ Di. Therefore c(T^ 3 \A^) < 
c(T (1) ,0) = £-=i c(lf\0) < TUT^dA P s%) < 
£ UE d e Di c(T*,A*) = £- =1 |A| x c(T*, A*) < 
5(£* = i c(T*,A*)) = 5 x c(T*,A*). The theorem follows. 

■ 

Time Complexity. We first find the shortest path be¬ 
tween any two nodes in G with Johnson’s algorithm in 
0 (|H||.E| + |V | 2 log |H|) time as the pre-processing procedure. 
Multi-Tree Routing Phase constructs the shortest-path tree for 
each source ,s,; and its corresponding destination set I),, and 
MTRSA compares the distance from a destination node to all 
other nodes in 0(|V|) time. Processing all d £ Dj requires 
0(|J7| A|) = 0(5|V|) time, and processing all shortest-path 
trees requires 0(t6\V\) time. After constructing the shortest- 













path trees, MTRSA reroutes the paths from each overloaded 
node to some of its downstream nodes. Since there are at most 
tS branches in the tree set, we reroute at most 5 paths for each 
branch node, and each rerouting requires the comparison of at 
most |V'l distances of paths. Therefore, the above procedure 
requires 0(tS 2 \V\ log |H|) time, and Routing Phase requires 
0(tS 2 \V\ log |V'l) time. 


Afterward, in Greedy Assigning Stage of State-Node As¬ 
signment Phase, there are at most t\V\ branch state nodes 
required to be assigned, and this stage has at most t\V\ 
iterations. In each iteration, we first derive the minimum cost 
reduced by assigning a branch state node on each node v of 
every X) in 0(t\V\) time, and we update the cost reduced 
by assigning each new branch state node to the tree T) in 
0(|Tj||i?|) = 0(|Vj|.E|) time. Therefore, this stage requires 
0(t\V\(t\V\ + \V\\E\)) = 0(t\V\ 2 (t+\E\)) time. Then, Local 


Search Stage carefully examines the overloaded nodes. In each 
iteration, we adjust the branch state nodes on each node u 
in different trees without changing other branch state nodes 
to find the new bandwidth cost in 0(t\E\) time, and this 
stage takes 0(f|Vj|i?|) time because there are at most \V\ 
iterations. Therefore, State-Node Assignment Phase requires 
0(t\V\ 2 (t + |£j)) time to allocate the branch state nodes and 
0(tS 2 \V \log |Vj) time for rerouting, and MTRSA requires 
0{t8\V\ 2 log \V\{t + |i?|)) time. 


C. Extension to SMTE 


For SMTE, since the number of times that each packet 
is delivered in a link cannot be acquired before assigning 
the branch state nodes, we first present the concept of weak 
edge capacity constraint. Let e\ u v = 1 if £i, u ,v is a positive 
integer in our Integer Programming formulation, and e[ u v = 0 
otherwise. MTRSA needs to ensure J2i<i<t ft x £ 'i,u v — c u,v 
holds for Ve U) „ £ E before assigning the branch state nodes. 
In addition, in the general case of SMTE, the storage size of 
each branch state node u in Group Table is proportional to the 
degree of u in the corresponding multicast tree m. Therefore, 
let f3i tU denote the node weight (i.e., storage size) of assigning 
a branch state node on u to tree Ti, and b u here denotes the 
size of Group Table in u. For SMTE, we extend MTRSA as 
follows. 


Before Multi-Tree Routing Phase, we sort the multicast 
trees in T according to their data rates in the ascending order, 
/i < f‘z < • • • < ft . I n Multi-Tree Routing Phase, we find the 
first shortest-path tree 7) in T and decrease the link capacity 
c u , v for every edge in T\ by its flow rate /j. Note that any 
edge e u , v with insufficient residual capacity to support fy will 
be removed since it cannot support the rest of the multicast 
flows. The above procedure is repeated for other trees in T. 

In the rerouting procedure of Multi-Tree Routing Phase, 
we reroute each multicast tree Ti according to the weak edge 
cap acity c onstraint, such that any new path from w to v in Sec- 
tion lV-Al must have sufficient capacity to support f,. In Greedy 
Assignment Stage, we find an element x = (i,u) in N — A 


according to 


z(AU{(i,u)})—z(A) _ 


= max{ 


z(AU{(« / ,'U / )}) — z(A) 

T 


A U {( i',u ')} £ Af}, which represents the normalized cost 
reduction. In other words, the node weight is considered 
during the assignment of branch state nodes. 


In Local Search Stage, optimizing the state-node assign¬ 
ment of one node becomes the same as the knapsack problem 
because each candidate branch state node now has a profit (i.e., 
cost reduction) and a size (i.e., node weight), and we exploit 
Polynomial-Time Approximation Scheme for knapsack (26) to 
find the solution. In the rerouting procedure of Local Search 


Algorithm 1 Multi-Tree Routing and State Assignment Algo- 
rithm (MTRSA)_ 

Require: A network G = (V, E), source nodes si, S 2 ,..., St, 
destination sets D i, £>2, • • • , Dt, and State-Node table A. 
Ensure: Multicast trees , T), Sj is the root of 7), 

and Di is in Ti. 

1 : //Multi-Tree Routing Phase 
2 : for * £ {1, 2 , ...,£} do 

3: Ti <r- shortest path tree containing 77, with root Si 

4: 

5: for overloaded node u do 

6 : Reroute an appropriate downstream node v of u to 

balance the distribution of branch nodes 

7: 

8 : //State-Node Assignment Phase 1) Greedy Assigning Stage 
9: A <- [0] 

10 : while there is x £ N — A such that A U {x} £ M do 
11 : x max £- arg max {Z(A U {x}) : A U {x} £ M} 

xCzN—A 

12 : A i A U {x max } 

13: 

14: //State-Node Assignment Phase 2) Local Search Stage 
15: for overloaded node u do 

16: Re-assign node state on u to maximizing reduction 

17: 

18: for overloaded node u do 
19: while node u is overloaded do 

20 : for each node 2 in V do 

21 : if node z is not overloaded then 

22 : Reroute node the downstream node v of u to 

node z 

23: Break the for loop 

24: return Ti, T%, ■ ■ ■ ,T t and A 


Stage, since now the branch state nodes have been specified, 
we reroute each multicast tree T) according to the original 
edge capacity constraint (7), such that any new path from w 
to v must have sufficient capacity to support f,. If the amount 
of multicast flows in any edge exceeds the capacity constraint, 
we also reroute its closest upstream state node it in a tree T 
to w, such that the new path from w to v follows the link 
capacity constraint. 

MTRSA can support the dynamic multicast group mem¬ 
bership as follows. When a user v joins or leaves a multicast 
group, MTRSA adds or trims (if no other users are located 
downstream to the user) the corresponding branch from the 
upstream branch node u in the same way as Multi-Tree Rout¬ 
ing Phase. Afterward, State-Node Assignment Phase adjusts 
the new branch state node if necessary. Therefore, it does not 
need to re-compute the whole tree. 

D. Pseudo Code 

The pseudo code of MTRSA is shown in Algorithm Q] 

VI. Performance Evaluation 

In this section, we first compare MTRSA and other 
approaches with real topologies. Afterward, we deploy our 
algorithm in a small experimental SDN network with HP SDN 
switches to evaluate the video performance with YouTube HD 
traffic that requires a large amount of bandwidth consumption.. 

A. Simulation Setup 

We simulate our algorithm in two real networks: Vtl- 
Wavenet2011 and Columbus l27i . VtlWavenet2011 includes 
91 nodes and 96 links, while Columbus has 70 nodes and 85 
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links. Our simulation is divided into small-scale and large- 
scale cases. The numbers of trees in the small-scale cases are 
smaller than 100, whereas there are more than 2000 trees in the 
large-scale cases. The link capacity in the topologies is set to 
the level that the maximal bottleneck link utilization reaches 
100% Il28l , and the edge cost of each link is set as 1. We 
vary the number of multicast trees, the number of destinations, 
and node capacity. The source and destinations are chosen 
randomly from each network. 

We compare MTRSA with the following algorithms: 1) 
the shortest-path tree algorithm (SPT), 2) the Steiner tree (ST) 
algorithirQ |7), and 3) CPLEX | |29l , which finds the optimal 
solutions of SMTE problem by solving the IP formulation in 
Section [III] In SPT and ST, the branch state nodes of different 
trees are randomly assigned to a branch node when node is 
fully utilized, i.e., the number of branch state nodes reaches 
the node capacity. Each SPT and ST is added to the network 
iteratively. If an edge does not have sufficient residual capacity 
to support the multicast flow of a new SPT or ST, it will be 
removed accordingly to avoid choosing the edge in the SPT or 
ST. We implement all algorithms in an HP DL580 server with 
four Intel Xeon E7-4870 2.4 GHz CPUs and 128 GB RAM. 
Each simulation result is averaged over 100 samples. 

B. Small-Scale Evaluation 

In the small-scale cases, we compare the total bandwidth 
costs of all trees in MTRSA, SPT, ST, and CPLEX with dif¬ 
ferent number of trees (|T|), different node capacity (b u ), and 
different number of destinations \D\. As shown in Fig. |4]and 
Fig. [5] MTRSA generates a solution with the total bandwidth 
cost very close to the optimal solution. Although SPT chooses 
the shortest path to the destinations, it does not carefully 
examine the node capacity, and its cost is thus higher than 
MTRSA. Compared with SPT, the distance from the source to 
a destination in ST is usually higher because the path needs to 
be deviated from the shortest one in order to share more edges 
with another path. Therefore, more branch nodes are usually 
involved in ST. Without a sophisticated allocation of branch 
state nodes, ST incurs a slightly higher cost than SPT due to the 
additional bandwidth consumption in unicast tunneling for the 
branch nodes with full Group Table. The difference becomes 
more significant when the number of trees increases as shown 
in Fig. 0] Similarly, Fig. [5] manifests that the total bandwidth 
cost of each tree increases as the number of destinations grows, 
because each tree becomes larger in this case. 

C. Large-Scale Evaluation 

In the following, we evaluate MTRSA, ST, and SPT in 
larger-scale cases, where the number of multicast tree is 
ranged from 2000 to 10000, the number of destinations is 
from 5 to 25, and the node capacity is between 50 and 250. 
Compared with smaller-scale cases, the advantage of MTRSA 


3 There are some single-tree multicast routing algorithms with differernt 
purpuses (such as QoS), but they are not included in this study because ST 
(i.e., the optimal solution for single tree) outperforms those approaches in 
terms of the bandwidth consumption. 
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is more significant in larger-scale cases. In Fig. [6] the total 
cost increases with the number of trees. For a larger network, 
the source and any destination are inclined to be located with 
distantly, but there is also a higher chance to find a node with 
sufficient capacity as a branch node for rerouting. Therefore, 
MTRSA effectively reduces the total bandwidth cost by 66% 
and 59%, respectively, compared to ST and SPT. In addition. 
Fig. [7] shows that the bandwidth costs can be effectively 
reduced when we increase node capacity, and setting the node 
capacity as 100 is sufficient for MTRSA. On the other hand, 
the bandwidth cost grows with the number of destinations, 
because all trees are required to span more nodes as shown in 
Fig.® 

Table I summarizes the running time of MTRSA with 
different |T| and \D\. With a smaller input, such as 2000 trees 
and 5 destinations in each tree, the running time for MTRSA 
is around 1 second. As |T| and \D\ increase, MTRSA only 
requires around 73 seconds in the largest case with 10000 
multicast trees. Therefore, it is envisaged that our algorithm is 
practical to be deployed in SDN. 































o 


2500 
2000 
1500 
1000 
500 

100 200 300 400 



Time (s) 


(b) Bandwidth consumption 


Fig. 9. Implementation results of the experimental SDN 


D. Implementation 

To evaluate MTRSA in real environments, we implement it 
in an experimental SDN with HP Procurve 5406zl OpenFlow- 
enabled switches. We use Floodlight as the OpenFlow con¬ 
troller to install the multicast forwarding rules in SDN-FEs. 
We install multicast group information in group table and 
create virtual ports mapping to multiple physical ports to 
forward multicast traffic. MTRSA is running on the top of 
Floodlight. Our testbed includes 12 nodes and 24 links as 
shown in Fig. |9j where the link capacity and node capacity 
are set as 50Mbps and 5, respectively. We randomly select 10 
nodes as the video multicast sources, where each source is 
connected to a Youtube proxy to facilitate YouTube multicast. 
We implement the Youtube proxy by using VLC player, which 
can request video stream from Youtube and work as a video 
server. To support multicast, we modify TCP to aggregate TCP 
ACKs from multiple clients. The full-HD test video is in 460 
seconds encoded in H.264 with the average bit rate as 10Mbps. 
For each source, we randomly assig n 10 destinations that play 
videos using the VLC player. Fig. |9(b)| shows that the total 
bandwidth consumption during playback, and we average the 
bandwidth consumption every 40 seconds. The results manifest 
that the bandwidth consumption of MTRSA is 46% and 35% 
lower than ST and SPT, respectively. Therefore, MTRSA can 
effectively support multicast traffic engineering in SDN. 

VII. Conclusion 

Recent studies on traffic engineering for SDN mostly focus 
on unicast, while most existing multicast routing algorithms 
are designed to find the routing of a multicast tree, instead of 
multiple trees. In this paper, therefore, we have formulated 
Scalable Multicast Traffic Engineering Problem (SMTE) to 
minimize the total bandwidth cost according to the link and 
node capacity constraints for multiple trees in SDN. We have 
proved that SMTE-N is NP-hard and not able to be approxi¬ 
mated within 5, while SMTE cannot be approximated within 
any ratio. To solve the problem, we have proposed Multi-Tree 
Routing and State Assignment Algorithm (MTRSA), which is 
a ^-approximation algorithm for SMTE-N, while MTRSA has 
been extended to support SMTE as well. Simulation based 
on real topologies and implementation with Youtube traffic 
manifest that MTRSA can effectively find the routing of 
multiple multicast trees and assign the branch state nodes in 
order to reduce the total bandwidth cost, while the computation 
time to construct numerous trees is also reasonable for practical 
SDN. Since the tree obtained by MTRSA is not delay bounded, 
we will extend it to support QoS multicast in the future work. 
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